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Description 

FIELD OF THE INVENTION 

[0001] The present invention relates to a method and 
apparatus for measuring speech quality of a voice call. 
The invention is particularly related to, but in no way lim- 
ited to, measuring the speech quality of voice over in- 
ternet protocol calls using a PESQ algorithm. 

BACKGROUND TO THE INVENTION 

[0002] Voice over internet protocol (VoIP) implemen- 
tations enable voice traffic such as telephone calls and 
faxes to be carried over an internet protocol communi- 
cations network. Such implementations are advanta- 
geous because they provide lower cost long distance 
telephone calls (as compared with telephone calls made 
over public switched telephone networks for example). 
In addition, it is possible to merge data and voice com- 
munications network infrastructures thus providing 
economies of scale and increased coverage as well as 
unified messaging and other services. 
[0003] During a VoIP telephone call, the voice signal 
from a user is processed by a digital signal processor 
and then compressed before being stored in packets 
that are suitable for being transported using internet pro- 
tocol in compliance with one of the specifications for 
transmitting multimedia (voice, video, fax and data) 
across a communications network. The packets are 
transmitted across a communications network to a 
called party for example, using real time transport pro- 
tocol (RTP). When Ihe packeLs are received at their des- 
tination, the voice signal is decompressed before being 
played to a called party. The specific path that the pack- 
ets take over the communications network is not spec- 
ified and can be any suitable path that is available. Thus, 
several different VoIP calls between the same destina- 
tions may take different actual paths over the commu- 
nications network. 

[0004] Any suitable compression/decompression 
scheme is used, and these are referred to as coder-de- 
coder compression schemes (CODECs). 
[0005] One issue for packet based voice calls is how 
to provide speech quality levels that are comparable or 
better than those provided on public switched telephone 
networks. Speech quality in packet calls is affected by 
many factors such as delay, jitter, packet loss and CO- 
DEC performance. 

[0006] A need thus arises for meaningful measures of 
speech quality to be provided that are simple and inex- 
pensive to calculate and which do not themselves in- 
crease network load. For example, service providers 
may enter into contracts with customers to provide spec- 
ified levels of speech quality between specified end 
points. In order for both the service provider and cus- 
tomer to ensure that the contract is being met, a meas- 
ure of speech quality is needed. 
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[0007] Many different measures existforspeech qual- 
ity. For example, the number of packets dropped can be 
monitored and used as an indicator of speech quality. 
However, speech quality is in the end perceived by hu- 

s man users and so subjective measures of speech qual- 
ity have been developed. Mean Opinion Score (MOS) 
is one such subjective measure of speech quality which 
is obtained by obtaining judgements from a wide range 
of listeners. Those listeners hear a voice sample from a 

io particular CODEC and rate their perception of that voice 
sample on a scale of 1 (bad) to 5(excellent). These types 
of subjective tests are of course time consuming and 
costly to carry out. 

[0008] Many other measures of speech quality exist. 
15 For example perceptual speech quality measure 
(PSQM) is an objective measure of speech quality that 
is obtained by transmitting a test voice signal through a 
codec encode and decode, and then comparing the re- 
sult with the original. However, PSQM is not able to take 
20 proper account of filtering, variable delay and short lo- 
calised distortions that can occur in packet switched net- 
works, so it is not suitable for end to end speech quality 
measurement. PSQM is described in detail in Interna- 
tional Telecommunication Union (ITU) recommendation 
25 p.861 . More recently, an algorithm, known as perceptual 
evaluation of speech quality (PESQ) has been devel- 
oped, which is capable of taking proper account of fil- 
tering, variable delay and short localised distortions. 
Hence, this algorithm is appropriate for end to end 
30 measurement over packet switched networks. PESQ 
provides an estimated MOS of the speech quality and 
is the subject of draft ITU recommendation P.862. Fur- 
ther details of the PESQ alborithm are given in "PESQ 
- the new ITU standard for end-to-end speech quality 
35 assessment", Published at 109th AUDIO ENGINEER- 
ING SOCIETY Convention, 2000 September 22-25 Los 
Angeles, California, USA. Authors: Antony W. Rix, John 
G. Beerends, Michael P. Hollierand Andries P. Hekstra, 
the contents of which are incorporated herein by refer- 
40 ence. PESQ related information is also given in Interna- 
tional Patent Publication No. WO 00/22803 which de- 
scribes an apparatus for measurement of speech signal 
quality. 

[0009] When the PESQ or similar algorithms are used 
45 to measure speech quality, a dedicated voice call is set 
up to transmit only test speech signals over a commu- 
nications network. This enables the test voice signals to 
be easily identified and provides a means of determining 
the amount of degradation that occurs as a result of 
so transmission overthe network. However, it is known that 
network parameters such as packet loss and packet de- 
lay are significantly variable for many packet switched 
networks. Therefore, the results from a single test call 
over a packet swiched network cannot be assumed to 
55 reflect the speech quality between the end-points on an- 
other occasion. 

[0010] Another method of evaluating speech quality 
is referred to as the "E model" and is defined in ITU-T 
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recommendation G.107. The E model is a computation- 
al model for determining the combined effect of various 
parameters on speech quality. The model evaluates the 
end-to-end network transmission performance and out- 
puts a scalar rating "R" for the network transmission 
quality. The model further correlates the network objec- 
tive measure, "R", with the subjective QoS metric for 
speech quality, MOS. The value of R depends on a wide 
range of factors such as sending loudness rating, re- 
ceiving loudness rating, sidetone masking rating, listen- 
er sidetone rating, send side D-value of telephone, talk- 
er echo loudness rating and many other such factors. 
[0011] The ITU-T E-Model is an analytical tool for es- 
timating the speech quality of end-to-end telephone 
connections. It is primarily a transmission planning tool 
rather than a rigorous psycho-acoustic model. As such 
it is not well suited to MOS estimation on individual ses- 
sion. For example, it is known that a sudden burst of lost 
packets can seriously degrade the speech quality over 
a VOIP network. However, if there is no other packet 
loss over the duration in which the percentage packet 
loss is calculated, the percentage packet loss can be 
low enough thatthe E-model predicts a high MOS value. 
The non-linear effects associated with jitter buffering 
can also cause inaccuracy in the E-model MOS predic- 
tion. Generally, a packet arriving at a jitter buffer much 
laterthan it was expected cannot be used to regenerate 
the output speech. Hence this packet is effectively lost, 
as far as speech quality is concerned. However when 
calculating the percentage of lost packets, this packet 
is not lost, so in this case the E-model overestimates the 
speech quality. 

[0012] Earlier co-pending US patent application 
number 09/680,829 which is also assigned to Nortel 
Networks, describes a method of obtaining a measure 
of speech quality during a voice call and displaying that 
measure on the telephone handsets of the calling and 
called parties. This provides the advantage that end us- 
ers are able to see at a glance a measure of the speech 
quality of a call. In US 09/680,829 the measure of 
speech quality is obtained by transmitting dummy test 
packets which do not contain speech or voice informa- 
tion from a source server to a destination server and 
back with the aim of measuring the average packet de- 
lay and the percentage of packets lost. These parame- 
ters are then input to an E-model, in order to generate 
an estimated MOS score. This estimated MOS score 
may be output to a display unit, for example, on a tele- 
phone handset. Whilst the system and method of US 
09/680,829 are satisfactory and operable the present in- 
vention addresses additional and/or different problems. 
[0013] One problem with many previous algorithms 
for measuring speech quality is that because test pack- 
ets are sent as part of a separated IP session, they may 
take a different route through the connectionless packet 
network than the packets of the ongoing voice call. This 
means that the test packets may be degraded as a result 
of transmission through the network in a different man- 



ner than the packets of the ongoing voice call. 
[0014] WO 98/53589 describes a system for simulat- 
ing a conversation over a non-perfect communications 
link and to measure received signal quality for the sim- 

s ulated conversation. The system seeks to take into ac- 
count the reaction of users to the system's behaviour 
which can influence the way the system performs. This 
approach involves making a call for test purposes only 
and does not consider the particular problems involved 

10 for packet-based, connectionless, communications net- 
works. 

[0015] An object of the present invention is to provide 
a method of measuring speech quality of a voice call 
which overcomes or at least mitigates one or more of 

15 the problems noted above. 

[0016] Another object of the present invention is to 
generate an improved estimated MOS score which is 
suitable for output to a display unit, for example, on a 
telephone handset. In this respect, the present invention 

20 seeks to extend and develop the work described in US 
patent application no 09/680,829. 
[0017] Further benefits and advantages of the inven- 
tion will become apparent from a consideration of the 
following detailed description given with reference to the 

25 accompanying drawings, which specify and show pre- 
ferred embodiments of the invention. 

SUMMARY OF THE INVENTION 

30 [0018] The invention provides a method of enabling 
speech quality to be measured as defined in claim 1 ; a 
method of measuring speech quality for a call as defined 
in claim 5; a packet-based communications network 
node as defined in claim 11 and as defined in claim 13; 
35 and a signal for a voice call as claimed in claim 8. 
[001 9] According to an example of the present inven- 
tion there is provided a method of measuring the speech 
quality of a voice call between a first node and a second 
node in a packet-based communications network. Each 
40 of thefirst and second nodes comprises the same stored 
test voice information and the method comprises the 
steps of, at the first node: 

• receiving packets for the voice call and adding at 
45 least part of the stored test voice information to at 

least some of the packets; 

• forwarding the packets to the second node; and 

so • at the second node, accessing the test voice infor- 
mation stored at the second node and comparing it 
with the test voice information received in the pack- 
ets using a speech quality assessment algorithm in 
order to obtain a measure of speech quality for the 

55 voice call. 

[0020] For example, each of the first and second 
nodes have the same pre-stored test vectors comprising 
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test voice information. The first node sends these test 
vectors to the second node as part of a live voice call. 
The second node receives the test vectors and is able 
to compare them with the pre-stored test vectors to de- 
termine how much degradation has taken place as a re- 
sult of transmission through the network. 
[0021] This provides the advantage that because the 
test voice information is sent as part of a live voice call 
itself, any degradation experienced by the test voice in- 
formation is closely associated with that experienced by 
actual voice information in the voice call itself. This en- 
ables a measure of speech quality to be obtained for the 
particular voice call. In contrast to previous methods, 
which transmit test packets in order to measure the 
packet loss percentage and then derive an estimate of 
thespeech quality MOS, this method derives the speech 
quality estimate from test speech that is embedded in 
the voice call itself. 

[0022] Another advantage is that because the speech 
quality measure is specific to a particular call, it is pos- 
sible to relate user reported issues to an exact quanti- 
tative measurement for the exact call in which the user 
has experienced those issues. 
[0023] In an example, some of the packets received 
at the first node comprise voice information associated 
with the voice call and others of those packets are as- 
sociated with periods when speech is absent from the 
voice call. In that case, said step (i) further comprises 
identifying those packels which are associated with pe- 
riods when speech is absent from the voice call and add- 
ing test voice information to one or more of those pack- 
ets. This enables the test vectors to be incorporated into 
a live voice call wiLhoul disrupLing or otherwise adverse- 
ly affecting that live voice call. The test vectors are in- 
corporated into "silent periods" in the live voice call (i.e. 
periods during which no speech takes place). 
[0024] For example the packet-based communica- 
tions network is an internet protocol communications 
network. However, this is not essential, other types of 
packet-based communications network may be used 
such as wireless local area network (LAN), global sys- 
tem for mobile communications (GSM) or third genera- 
tion (3G) networks. The invention is especially useful in 
packet-based communications networks where packet 
loss is a significant problem. 

[0025] In one example, the method further comprises 
making an indication in a header of each of those pack- 
ets to which test voice information is added. This ena- 
bles the second node to identify packets containing test 
voice information. If the second node is at an endpoint 
the test voice packets are separated from the packets 
containing the "live" voice information. The "live" voice 
information packets are forwarded to a CODEC and 
processed as is known in the art. 
[0026] For example, the indication is a payload value 
and the packets are real-time transport protocol (RTP) 
packets. Advantageously, the RTP protocol provides 
that some payload values may be user defined. Payload 



values can then be used to enable the second node to 
identify those packets which contain test voice informa- 
tion in a manner which requires no changes to be made 
to the existing RTP protocol and enables existing net- 
s work equipment that is configured for use with RTP to 
be used. 

[0027] In one example, the packets are forwarded 
from the first node to the second node via one or more 
other nodes which do not have access to information 
10 about the pre-specified identifier. For example, commu- 
nications network nodes which have no knowledge of 
the particular user defined payload value for identifying 
test voice packets, simply forward those packets as they 
would do for any other voice packets. Existing protocols 
15 such as RTP are arranged to do this and this provides 
the advantage that information about the user defined 
payload value only needs to be provided to those net- 
work nodes at which it is required to make speech qual- 
ity assessments. 
20 [0028] In one embodiment the first and second nodes 
are located substantially at the edge of the communica- 
tions network. This provides the advantage that speech 
quality assessments are made without needing to adjust 
or adapt core network nodes in any manner. However, 
25 this is not essential. If speech quality assessments are 
required at the core of the network, the first and or sec- 
ond nodes may be at the core of the network, 
[0029] In an example the speech quality assessment 
algorithm Is a PESQ algorithm. This provides the advan- 
ce tage that an estimated MOS score is provided for a "live" 
voice call that is determined using test voice information 
that has been transmitted integrally with that "live" voice 
call. 

[0030] According to another example of the present 

35 invention there is provided a signal for a voice call pro- 
vided over a packet-based communications network, 
said signal comprising a plurality of packets at least 
some of which comprise test voice information. For ex- 
ample, the voice call is a live voice call and as described 

40 above, because thetestvoice information is transmitted 
integrally with the live voice call, that test voice informa- 
tion can then be used to determine an accurate assess- 
ment of the quality of the live voice call. 
[0031] In an embodiment some of the packets are as- 

45 sociated with periods when speech is absent from the 
voice call and comprise test voice information. This en- 
ables test voice information to be transmitted integrally 
with a live voice call, without affecting that live voice call. 
For example, the packets are real-time transport proto- 

so col packets and some of the packets comprise a header 
with an indicator, indicating that those packets comprise 
test voice information. This enables a network node 
which receives the signal to identify those packets which 
contain test voice information. 

55 [0032] According to another example of the invention 
there is provided a packet-based communications net- 
work node arranged to enable speech quality to be 
measured for a voice call which is ongoing between a 
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caller and a called party said node comprising: 

• an input arranged to receive packets for the voice 
call; and 

• a processor arranged to add test voice information 
to one or more of the packets; 

• an output arranged to forward the packets towards 
the called party. 

[0033] For example, the network node receives pack- 
ets from a CODEC, adds test speech which has been 
encoded with a similar but separate CODEC to some of 
those packets and forwards the packets to the called 
party. This enables test voice information to be transmit- 
ted integrally with a live voice call. 
[0034] According to another example of the present 
invention there is provided a packet-based communica- 
tions network node arranged to measure speech quality 
for a call which is ongoing between a caller and a called 
party, said node comprising: 

• an input arranged to receive packets as part of the 
voice call some of which comprise voice information 
associated with the voice call and some of which 
comprise received test voice information; 

• stored test voice information; 

• a processor arranged to compare the received test 
voice information and the stored test voice informa- 
tion using a speech quality assessment algorithm 
in order to obtain a measure of speech quality for 
the voice call. 

[0035] This communications network node may be lo- 
cated at the core or at the edge of the communications 
network depending on where it is required to obtain an 
estimate of speech quality. 

[0036] According to another example of the present 
invention there is provided a method of measuring 
speech quality for a call which is ongoing, said method 
comprising, at a node in a packet based communica- 
tions network: 

• receiving packets as part of the voice call some of 
which comprise voice information associated with 
the voice call and some of which comprise received 
test voice information; 

• accessing stored test voice information; 

• comparing the received test voice information and 
the accessed stored test voice information using a 
speech quality assessment algorithm in orderto ob- 
tain a measure of speech quality for the voice call. 



[0037] According to another example of the present 
invention there is provided a method of enabling speech 
quality to be measured for a voice call which is ongoing 
between a caller and a called party said method com- 
s prising, at a node in a packet based communications 
network: 

• receiving packets for the voice call; 

10 • adding test voice information to one or more of the 
packets; and 

• forwarding the packets towards the called party. 

15 [0038] According to another example of the present 
invention there is provided a computer program for con- 
trolling a packet-based communications network node 
in order to enable speech quality to be measured for a 
voice call which is ongoing between a caller and a called 

20 party said computer program being arranged to control 
the node such that: 

• packets for the voice call are received; 

25 • testvoiceinformationisaddedtooneormoreofthe 
packets; and 

• the packets are forwarded towards the called party. 

30 [0039] The computer program may be stored on a 
computer readable medium. 

[0040] According to another example of the present 
invention there is provided a computer program ar- 
ranged to control a packet-based communications net- 
35 work node in order to measure speech quality for a call 
which is ongoing between a caller and a called party, 
said computer program being arranged to control the 
node such that: 

40 • packets are received as part of the voice call some 
of which comprise voice information associated 
with the voice call and some of which comprise re- 
ceived test voice information; 

45 • test voice information stored at the node is ac- 
cessed; and 

• the received test voice information and the stored 
test voice information are compared using a speech 

so quality assessment algorithm in order to obtain a 
measure of speech quality for the voice call. 

BRIEF DESCRIPTION OF THE DRAWINGS 

55 [0041] In order to show how the invention may be car- 
ried into effect, embodiments of the invention are now 
described below byway of example only and with refer- 
ence to the accompanying figures in which: 
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Figure 1 is a schematic diagram of a packet-based 
communications network comprising communica- 
tions network nodes modified for use in the present 
invention; 

Figure 2 is a schematic diagram of two of the com- 
munications network nodes of Figure 1 in more de- 
tail; 

Figure 3 is a flow diagram of a method carried out 
by one of the communications terminals nodes of 
Figure 2; 

Figure 4 is a flow diagram of a method carried out 
by the other communications network node of Fig- 
ure 2: 

DETAILED DESCRIPTION OF INVENTION 

[0042] Embodiments of the present invention are de- 
scribed below by way of example only. These examples 
represent the best ways of putting the invention into 
practice that are currently known to the Applicant al- 
though they are not the only ways in which this could be 
achieved. 

[0043] Figure 1 is a schematic diagram of a packet- 
based communications network comprising communi- 
cations network nodes (A, B, C) modified for use in the 
present invention. At nodes A, B and C, test voice infor- 
mation is stored which is the same at each node. For 
example, this test voice information comprises test vec- 
tors. Other nodes D, E, F and G do not have this stored 
test voice information, A user terminal 10 is shown con- 
nected to node A and another user terminal 1 2 connect- 
ed to node B. The user terminal 12 connected to node 
B also has the stored test voice information whilst the 
other user terminal 1 0 does not. 
[0044] Figure 2 shows the structure of nodes A and B 
in more detail. Both node A and node B comprise a 
memory with stored test voice information 21 such as 
test vectors. In addition nodes A and B each have a 
processor 22, 23 which is arranged in a particular way. 
In this example, node A's processor 23 is arranged to 
add test vectors to an ongoing voice call whilst node B's 
processor 22 is arranged to carry out a speech quality 
algorithm. It is possible for nodes A and B to have iden- 
tical processors which are arranged to carry out both 
these functions. 

[0045] Consider a voice call from terminal 10 to ter- 
minal 12. This call passes from node A to node B and 
the actual packets of the call may travel via different 
routes between those two nodes. For example, from A 
to E, F, G and then B or from A to C to B. The voice call 
is achieved using voice over internet protocol technolo- 
gy or any other suitable method for achieving a voice 
call over a packet-based communications network as is 
known in the art. For example, an internet protocol com- 
munications network is used with an RTP session being 
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set up between A and B for the voice call. 
[0046] When a voice call is ongoing between terminal 
10 and terminal 12, node A is arranged to add stored 
test voice information to some of the packets of that call. 

s Preferably, node A comprises a processor which is ar- 
ranged to identify silent periods during the voice call and 
to add packets comprising test voice information to the 
call duringthose silent periods. Note thatthetest speech 
must also pass through a codec, if one is used. In this 

10 way, some of the packets comprise only test voice infor- 
mation whilst other packets comprise only "real" voice 
information for the live voice call. However, this is not 
essential. The test voice information may be incorporat- 
ed into packets which comprise "real" voice information 

15 as long as means is provided to enable node B to dis- 
tinguish between these two types of information. 
[0047] For packets comprising test voice information, 
node A is arranged to include an indicator in the header 
of those packets to indicate that they comprise test voice 

20 information . For example, if the packets conform to the 
RTP protocol, this indicator may be a pre-specified pay- 
load type value. Node B has knowledge of this pre-spec- 
ified payload type value in orderthat it is able to separate 
the test voice information packets from the packets con- 

25 taining real voice information. However, it is not essen- 
tial to use a payload type value as the indicator. Any 
other suitable type of indicator may be used. 
[0048] Node A transmits the packets for the voice call , 
including those comprising test voice information, to 

30 node B in the usual manner as specified by the particular 
protocol being used for the call (for example, RTP). 
These packets follow any of the possible routes be- 
tween A and B and in doing so may pass through nodes 
which do not have any knowledge of the indicator used 

35 to identify those packets comprising test voice informa- 
tion (for example, nodes D, E, F and G in Figure 1). 
Those nodes are arranged to simply ignore any such 
identifiers and forward the packets in the normal man- 
ner. For example, in the case that RTP is used and the 

40 indicator is a payload type value, nodes which encoun- 
ter an unknown payload type value are arranged to for- 
ward those packets and take no further action. 
[0049] Individual packets forthe voice call between A 
and B may take different routes between A and B as 

45 explained above. This means that the packets compris- 
ing test voice information may follow different routes 
from the packets comprising the voice information for 
the ongoing voice call. However because the packets 
are all sent as part of the same voice call (for example, 

so as part of the same RTP session), the test voice infor- 
mation packets experience approximately the same ef- 
fects from transmission through the network as do the 
real voice information packets. This provides the advan- 
tage that an improved assessment of the amount of deg- 

55 radation experienced by the voice call is obtained. Pre- 
vious methods that have used dummy test packets 
(which contain no test or real speech information) to 
measure percentage packet loss provide a differenttype 
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of assessment. Other types of previous method have 
used dedicated calls for test speech to enable end to 
endtesting. In that case the test speech does not enable 
an accurate assessment of a particular voice call as in 
the present invention. In addition, many dedicated voice 
terminals can handle only one call at a time, so a sepa- 
rate call for test speech is not possible. 
[0050] Node B receives the packets and using its 
knowledge of the identifier is able to separate the re- 
ceived test voice information from the "real" voice infor- 
mation. The received test voice information is input to a 
speech quality assessment algorithm together with the 
stored test voice information, stored at node B. The 
speech quality assessment algorithm produces a meas- 
ure of the speech quality of the particular voice call. For 
example, this may comprise an estimated MOS score. 
Any suitable speech quality measurement algorithm 
may be used as described above although, in a pre- 
ferred embodiment the PESO algorithm is used. The 
speech quality measurement algorithm used needs to 
be able to generate an estimate of speech quality by 
comparing test speech signals with speech signals that 
have been transmitted over a packet switched network 
and may have been subject to effects such as filtering 
and variable delay. 

[0051] Once a measure of the speech quality of the 
particular voice call is obtained this information is pro- 
vided to a user (such as the network operator, service 
provider and or end user) in any suitable manner. For 
example, the measure may be displayed on a display 
screen at terminal 10 and terminal 12. The information 
may also be sent to a network management system. 
This enables a service provider to monitor the quality of 
service being provided and to make adjustments to the 
network as necessary. The measure is preferably pro- 
vided in real time and is directly related to a specific call 
as described above. 

[0052] It is not essential for node A in Figure 1 to add 
test voice information to all calls from node A. For ex- 
ample, only 5% of calls may be assessed for speech 
quality using the method described herein. If low levels 
of speech quality are detected this percentage can then 
be increased. 

[0053] It is also possible for a terminal itself to carry 
out the functions of nodes A and or B. For example, node 
12 in Figure 1 is shown as having stored test vectors. 
[0054] Figure 3 is a flow diagram of a method carried 
out by node A in Figure 1 or of any other suitable node 
which issues testvoice information (test vectors). At that 
node, the speech signal is broken into sections equal to 
the appropriate packet length, in the usual way. (see box 
30). 

[0055] Speech sections containing voice activity are 
detected using a voice activity detector in the usual way 
(see box 31). Speech sections containing voice activity 
are passed to the codec (if one is used). During speech 
sections containing silence, speech sections from the 
test vectors are passed through a separate, identical co- 
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dec (if one is used), and that codec's output is transmit- 
ted in place of the silence (see box 32). This results in 
some packets containing test vectors and no voice ac- 
tivity and other packets containing voice activity and no 

s test vectors. However, this is not essential. Some pack- 
ets may contain both test vectors and actual voice ac- 
tivity, provided that means is provided for later identify- 
ing the two types of information. 
[0056] Note that although packets containing the test 

10 vector have thus been embedded into the packet 
stream, they are identified with a different payload type 
in the packet header. All the packets are then forwarded 
to the same destination (see box 33). 
[0057] Figure 4 is a flow diagram of a method carried 

15 out by node B in Figure 1 or of any other suitable node 
which receives testvoice information and carries out a 
speech quality assessment algorithm. The node re- 
ceives packets comprising voice information and pack- 
ets comprising received test vectors (see box 40 of Fig- 

20 ure 4) and can identify each from the packet header. 
[0058] Packets identified as part of the voice stream 
are sent to decode CODEC for conversion to analogue 
signal and playing to a user (see box 41) in the usual 
way. Packets identified as part of the test vector stream 

25 are sent to the test vector decode CODEC and then to 
the speech quality algorithm (see box 42) for MOS es- 
timation. Thus the test speech passes through codec 
encode and decode. This means that the MOS score 
includes codec effects. 

30 [0059] In orderto take account of degradation that oc- 
curs as a result of CODEC processing, the test speech 
is also processing using a CODEC in the same way as 
the actual speech. A separate CODEC is used for the 
test speech in order that the CODEC used for the real 

35 speech is not affected. Preferably the additional CO- 
DEC forthe test speech is operated during periods when 
the CODEC for the real speech is inactive. In this way 
real-time requirements are not affected. 
[0060] Any range or device value given herein may 

40 be extended or altered without losing the effect sought, 
as will be apparent to the skilled person for an under- 
standing of the teachings herein. 
[0061] A range of applications are within the scope of 
the invention. These include situations in which it is re- 

45 quired to assess the speech quality of a voice call over 
a packet-based communications network. 

Claims 

50 

1. A method of enabling speech quality to be meas- 
ured for a voice call which is ongoing between a call- 
er and a called party said method being character- 
ised in that, at a node (A) in a packet based com- 
55 munications network the network comprises: 

(i) receiving packets for the voice call; 
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